Characterizing the high-level content of natural images using lexical basis functions

نویسندگان

  • John A. Black
  • Kanav Kahol
  • Prem Kuchi
  • Gamal Fahmy
  • Sethuraman Panchanathan
چکیده

The performance of content-based image retrieval using low-level visual content has largely been judged to be unsatisfactory. Perceived performance could probably be improved if retrieval were based on higher-level content. However, researchers have not been very successful in bridging what is now called the "semantic gap" between low-level content detectors and higher-level visual content. This paper describes a novel "top-down" approach to bridging this semantic gap. A list of primitive words (which we call "lexical basis functions") are selected from a lexicon of the English language, and are used to characterize the higher-level content of natural outdoor images. Visual similarity between pairs of images are then "computed" based on the degree of similarity between their respective word lists. These "computed" similarities are then shown to correlate with subjectively perceived similarities between pairs of images. This demonstrates that the chosen set of lexical basis functions is able to characterize the multidimensional content (and similarity) of these image pairs in a manner that parallels their subjectively perceived content (and similarity). If a retrieval system could be designed to automatically detect the visual content represented by these basis functions, it could compute a similarity measure that would correlate with human subjective similarity rankings.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Visual Concept Derivation from Natural Scenery Images Using Lexical Basis Functions, Multidimensional Scaling, and Density Clustering

Cognitive modeling of the visual concepts used by humans to classify images is a challenging task that first requires the characterization of images in terms of low-level or mid-level features that are salient to human visual perception. To complete the cognitive model, this characterization must then be correlated with higher-level concepts that are evoked in humans as they examine images. Thi...

متن کامل

The Use of Lexical Basis Functions to Characterize Faces, and to Measure Their Perceived Similarity

Over the last decade researchers have devised algorithms that can provide similarity measures between pairs of face images. These have been somewhat successful in estimating the similarities between face images under controlled conditions. However, those similarity measures do not parallel subjective similarity, as perceived by humans. In some applications it i s important to have a similarity ...

متن کامل

Object-Oriented Method for Automatic Extraction of Road from High Resolution Satellite Images

As the information carried in a high spatial resolution image is not represented by single pixels but by meaningful image objects, which include the association of multiple pixels and their mutual relations, the object based method has become one of the most commonly used strategies for the processing of high resolution imagery. This processing comprises two fundamental and critical steps towar...

متن کامل

Characterizing sub-topical functions

In this paper, we first give a characterization of sub-topical functions with respect to their lower level sets and epigraph. Next, by using two different classes of elementary functions, we present a characterization of sub-topical functions with respect to their polar functions, and investigate the relation between polar functions and support sets of this class of functions. Finally, we obtai...

متن کامل

Tuning Shape Parameter of Radial Basis Functions in Zooming Images using Genetic Algorithm

Image zooming is one of the current issues of image processing where maintaining the quality and structure of the zoomed image is important. To zoom an image, it is necessary that the extra pixels be placed in the data of the image. Adding the data to the image must be consistent with the texture in the image and not to create artificial blocks. In this study, the required pixels are estimated ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003